首页> 外文OA文献 >A novel feature selection approach for biomedical data classification.
【2h】

A novel feature selection approach for biomedical data classification.

机译:一种用于生物医学数据分类的新颖特征选择方法。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper presents a novel feature selection approach to deal with issues of high dimensionality in biomedical data classification. Extensive research has been performed in the field of pattern recognition and machine learning. Dozens of feature selection methods have been developed in the literature, which can be classified into three main categories: filter, wrapper and hybrid approaches. Filter methods apply an independent test without involving any learning algorithm, while wrapper methods require a predetermined learning algorithm for feature subset evaluation. Filter and wrapper methods have their, respectively, drawbacks and are complementary to each other in that filter approaches have low computational cost with insufficient reliability in classification while wrapper methods tend to have superior classification accuracy but require great computational power. The approach proposed in this paper integrates filter and wrapper methods into a sequential search procedure with the aim to improve the classification performance of the features selected. The proposed approach is featured by (1) adding a pre-selection step to improve the effectiveness in searching the feature subsets with improved classification performances and (2) using Receiver Operating Characteristics (ROC) curves to characterize the performance of individual features and feature subsets in the classification. Compared with the conventional Sequential Forward Floating Search (SFFS), which has been considered as one of the best feature selection methods in the literature, experimental results demonstrate that (i) the proposed approach is able to select feature subsets with better classification performance than the SFFS method and (ii) the integrated feature pre-selection mechanism, by means of a new selection criterion and filter method, helps to solve the over-fitting problems and reduces the chances of getting a local optimal solution.
机译:本文提出了一种新颖的特征选择方法来处理生物医学数据分类中的高维问题。在模式识别和机器学习领域已经进行了广泛的研究。文献中已经开发了数十种特征选择方法,这些方法可以分为三大类:过滤器,包装器和混合方法。过滤器方法在不涉及任何学习算法的情况下应用独立测试,而包装器方法需要用于特征子集评估的预定学习算法。过滤器方法和包装器方法分别具有各自的缺点,并且彼此互补,因为过滤器方法的计算成本低且分类可靠性不足,而包装器方法往往具有较高的分类精度,但需要很大的计算能力。本文提出的方法将过滤器和包装器方法集成到顺序搜索过程中,旨在提高所选特征的分类性能。提出的方法的特征在于(1)添加预选择步骤以提高搜索具有改进分类性能的特征子集的效率,以及(2)使用接收器工作特征(ROC)曲线表征各个特征和特征子集的性能在分类中。与被认为是文献中最佳特征选择方法之一的常规顺序正向浮点搜索(SFFS)相比,实验结果表明(i)所提出的方法能够选择具有比分类更好的分类性能的特征子集。 SFFS方法和(ii)集成的特征预选机制,借助新的选择准则和过滤器方法,有助于解决过拟合问题,并减少了获得局部最优解的机会。

著录项

  • 作者

    Peng, Y; Wu, Z; Jiang, J;

  • 作者单位
  • 年度 2010
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号